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Abstract 

Fractional repetition (FR) codes are a family of repair-efficient storage codes that provide exact and 
uncoded node repair at the minimum bandwidth regenerating point. The advantageous repair properties are 
achieved by a tailor-made two-layer encoding scheme which concatenates an outer maximum-distance- 
separable (MDS) code and an inner repetition code. In this paper, we generalize the application of FR 
codes and propose heterogeneous fractional repetition (HFR) code, which is adaptable to the scenario 
where the repetition degrees of coded packets are different. We provide explicit code constructions by 
utilizing group divisible designs, which allow the design of HFR codes over a large range of parameters. 
The constructed codes achieve the system storage capacity under random access repair and have multiple 
repair alternatives for node failures. Further, we take advantage of the systematic feature of MDS codes 
and present a novel design framework of HFR codes, in which storage nodes can be wisely partitioned 
into clusters such that data reconstruction time can be reduced when contacting nodes in the same cluster. 
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I. Introduction 

Cloud storage has emerged in recent years as a promising paradigm for storing large amounts of data, 
and is currently backed by practical cloud storage services such as Amazon S3 and Windows Azure. In a 
cloud storage system, the original data file is spread across a large number of distinct nodes such that a 
client can access the data even in the presence of node failures. This distributed storage manner has been 
widely employed in enterprise systems such as Google File System (GFS) 121, Hadoop Distributed File 
System (HDFS) |j3l, etc. However, the storage nodes in the cloud system have reached such a massive 
scale that failures are the norm rather than exception. In order to provide reliable storage over unreliable 
nodes, redundancy should be introduced to cloud storage systems. The simplest strategy is replication in 
which the file is available as long as one of the copies remains intact. For example, each data chunk in 
GFS is replicated several times, with the minimum being three 12 ■ 

For the same level of redundancy, erasure coding techniques can improve data reliability when compared 
to replication Q- In an erasure coding mechanism, the original data object is encoded and spread over n 
different nodes in a distributed fashion. A data collector (DC) can reconstruct the source file by attaching 
to arbitrary k out of the n storage nodes. In other words, the system can tolerate up to n — A: concurrent 
node failures without data loss. Reed-Solomon (RS) code for instance, a classical kind of MDS codes, has 
been implemented on top of the HDFS for old data archiving Q- Upon failure of a node, the conventional 
method is to reconstruct the entire source file, re-encode and deliver the encoded block to a replacement 
node. This process, termed node repair, results in large bandwidth overheads since only a small fraction 
of the downloaded data is stored eventually. 

Regenerating code |j6l is a class of erasure codes with the capability to minimize the amount of data 
stored per node (denoted by a) and the bandwidth consumption during the repair process. When a node 
fails, it can be regenerated by contacting any subset of d surviving nodes and downloading /? packets 
from each node. These d nodes that participate in the repair process are referred to as helpers. For the 
case that the data stored on the replacement node is identical to that on the failed node, the repair model 
is called exact repair. Functional repair, in contrast, differs from exact repair that the content of the new 
node need not be the same as in the failed node, as long as the functionality of the failed node is retained 
after repair. Furthermore, based on the storage-bandwidth trade-off derived in |6|, two extreme points are 
obtained by minimizing a and /3 respectively, which correspond to the minimum storage regenerating 
(MSR) point and minimum bandwidth regenerating (MBR) point. We refer the readers to [71 for more 
details of regenerating codes (e.g., explicit code constructions and performance evaluations). 


September 15, 2015 


DRAFT 


3 


Although regenerating codes can reduce the repair overhead as compared to traditional erasure codes, 
they suffer from the drawback that the repair process is of high complexity that each contacted node has to 
read all the data it stored in order to transfer a linear comhination to the replacement node. Motivated hy 
this, a simplified repair scheme at the MBR point was first introduced in |j8l and later extended to a new 
family of codes with a tahle-hased repair model, termed fractional repetition (FR) codes Q- Specifically, 
fhe repair process of FR codes is characterized hy fhe uncoded repair properly: a surviving node reads 
and delivers the exact amount of data to the replacement node without extra arithmetic operations. The 
desirable repair property is derived from the relaxation in choosing helper nodes: the condition that any d 
surviving nodes can be contacted for repair (random access repair) is relaxed to specific subsels of d nodes 
(table-based repair). Clearly, uncoded repair is optimal in terms of repair bandwidth, disk input/output 
(FO) and computational complexity, which makes FR codes attractive in practical storage systems. 

A. Main Contributions 

In this paper, we focus on a class of erasure codes that provide exact and uncoded repair at the MBR 
point. Specifically, we propose a flexible replication scheme by relaxing the constraint on repetition degree 
in conventional FR codes and taking different types of repetition degrees into consideration. We refer to 
these new codes as heterogeneous fractional repetition (HFR) codes. We present explicit constructions of 
HFR codes from combinatorial designs, in particular from group divisible designs. The designed codes 
achieve the system storage capacity under the random access repair model and allow to have multiple 
repair alternatives in the presence of node failures. Furthermore, we present a novel design framework of 
HFR codes in which the outer MDS code is always made systematic (i.e., one copy of the original data 
remains in uncoded form). In this framework, we wisely partition the storage nodes into several clusters 
such that in normal cases, the source file can be read directly from nodes in the same cluster. 

B. Organization 

The rest of this paper is organized as follows. Section II reviews the related works on erasure codes that 
provide uncoded repair. Section III presents the necessary notations and definitions. Section IV introduces 
the encoding scheme of HFR codes. Section V presents explicit constructions of HFR codes and a novel 
design framework based on HFR codes. Finally, Section VI concludes this paper with general remarks. 

H. Related Works 

In this section, we review a number of closely related works, such as repair-by-transfer codes IfTOll and 
fractional repetition codes |f9|. 


September 15, 2015 


DRAFT 


4 


A. Repair-by-transfer Codes 

In order to simplify the repair process of conventional regenerating codes, the idea of repair-hy-transfer 
is introduced in ifTOl . where a failed node is repaired hy simple transfer of data without the need for any 
computations. We note that the concept of repair-hy-transfer is essentially the same as uncoded repair. 
The advantageous properties of repair-hy-transfer include reduced repair complexity and minimum disk 
reads at helper nodes. In particular, for the case that d = n—1 (i.e., all the surviving nodes participate in 
the repair process), Shah et al. presented in lITOl an explicit construction of exact MBR codes with repair- 
hy-transfer via a fully connected graph. In a recent work ifTTl . a new class of repair-hy-transfer codes is 
proposed based on congruent transformations, in which the minimum field size and encoding complexity 
of code construction are superior as compared with ITOl . Moreover, a different line that focuses on the 
functional repair model, called functional-repair-hy-transfer regenerating code, is presented in ifT^ and 
implemented in practical cloud storage systems |[T3l . 

B. Fractional Repetition Codes 

Fractional repetition codes are a class of erasure codes characterized hy an exact and uncoded repair 
regime. A tailor-made two-layer encoding process guarantees the desirable repair properties. Specifically, 
a source file is faken as an input to a pre-determined MDS code, where several coded packets are generated 
as output. Each coded packet is then replicated p times and distributed over distinct storage nodes. The 
policy that specifies the placement of all coded packets, is called the repetition code. As shown in ||9l, 
an FR code with repetition degree p can tolerate up to p — 1 concurrent node failures without loss of the 
exact and uncoded repair property. 

We illustrate the construction of FR codes via a simple example. Consider a file composed of M = 6 
packets, we adopt a (7, 6) MDS code that outputs 7 coded packets indexed from 1 to 7. Each packet is 
replicated p = 3 times and spread over n = 7 distinct nodes, as shown in Figure [T] Any DC who contacts 
a subset of k = 3 nodes can obtain at least 6 distinct packets, which can be used to reconstruct the source 
file. To illustrate the core idea of exact and uncoded repair, let us assume without loss of generality that 
node is failed, the lost packets can be recovered directly by downloading the corresponding replicas 
stored on other d = 3 surviving nodes (e.g., nodes N 2 , and N 4 ). It is noteworthy that the repairs of 
FR codes are table-based ||9l, which implies that a specific set of nodes (rather than any d nodes) need 
to be contacted, depending on which node has failed. 

Note that MDS codes have been historically investigated and there exist many available constructions 
in the literature, the main challenge of FR codes lies in the design of the inner repetition code. Existing 
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Storage Nodes 


Fig. 1. A fractional repetition code with repetition degree p = 3. 

constructions of FR codes are primarily derived from combinatorial designs. In the pioneer work I®, an 
explicit code construction is presented based on regular graphs for the case of single node failure and 
Steiner systems are adopted in the scenario of multiple failures. In ifTdll . Koo and Gill considered projective 
geometries (e.g., bipartite cage graphs and mutually orthogonal Latin squares) in the construction of FR 
codes. Subsequently, the authors explored in ifTSll code constructions using resolvable designs, where the 
packet repetition degree can be varied over a large range. They further considered the usage of Kronecker 
product to devise new FR codes from existing codes ifT^ . FR codes based on combinatorial configurations 
are discussed in lITT]! . Further, optimal code constructions that attain the bounds on the fractional repetition 
capacity are presented in lITSl . which are based on graph theory (e.g., biregular graphs, projective planes 
and generalized polygons). 

Some recent studies propose codes with uncoded repair for heterogeneous systems, where the storage 
capacities of different nodes may not be the same. Gupta et al. llT9l introduce weak fractional repetition 
codes, where each storage node contains different number of packets. Based on the partial regular graph, 
their construction can tolerate single node failure. Yu et al. |[20l present irregular fractional repetition 
codes for heterogeneous storage networks, which can be used to minimize the system repair cost. In our 
recent work 1211, we propose general fractional repetition codes, which are adaptable to heterogeneous 
storage capacities and multiple node failures. 

While prior works assume that each packet is replicated the same times after the outer MDS code, we 
consider a different dimension in this paper that the repetition degrees of coded packets can be different. 
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III. Preliminaries 

A. Distributed Storage Systems 

Consider a distributed storage system (DSS) with parameters (n, k, d), where n is the total number of 
nodes in the system, k is the number of nodes that a data collector needs to contact for data reconstruction 
and d is the repair degree that indicates the number of nodes a replacement node has to contact during 
the repair process. In normal cases, a system designer should choose the parameters that satisfy 

k < d < n — 1. (1) 

In this paper, we focus on exact repair where the regenerated packets are identical to the lost packets. 
During the repair process, a replacement node downloads and stores exactly one packet (i.e., /3 = 1) each 
from d helper nodes it contacts, which implies that the storage capacity is d for every node in the system. 
Under this model, the storage capacity of the DSS, denoted by Cmbr, is defined as the maximum file 
size that can be delivered by contacting any k out of the n nodes at the MBR point. In particular, for 
the case that /3 = 1, the storage capacity Cmbr of an {n,k,d) DSS is shown in 161 to be 


Cmbr = kd 



( 2 ) 


It is noteworthy that this expression is derived from the functional repair model. However, Rashmi et 
al. showed in lIH that MBR codes with exact repair can still achieve the capacity Cmbr of (H). 


B. Group Divisible Designs 

Definition 1. A combinatorial design (or simply a design) is a pair where V is a set of elements 

called points, and is a collection (i.e., multi-set) of nonempty subsets of V called blocks. 

Group divisible design (GDD) is an important academic branch of the design theory. In this subsection, 
we briefly present the definitions and properties of GDDs, which will be used in our constructions. For 
an overview of design theory, the reader is referred to 1221 and ll23l . 

Definition 2. Let 'F and 0 denote sets of positive integers. A group divisible design of order v (denoted 
by 'F-GDD) is a triple where V is a finite set (the point set) of cardinality v, Q is & partition 

of V into parts (groups) whose sizes lie in 0, and is a family of subsets (blocks) of V that satisfy the 
following properties: 
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1) lal > 1. 

2) If i? G B, then \B\ G 

3) Every pair of points from distinct groups occurs in a unique block. 

If ^' = {'4’}, then 'k-GDD is abbreviated as V’-GDD. If u = + U202 + ... + UgOs and if there are 

Ui groups of size 6 i,i = 1,2,s, then the 'k-GDD is said to be of type For example, 

if V = 5}, a = {{!}, {2,3}, {4,5}} andS = {{2,5}, {3,4}, {1, 2,4}, {1, 3, 5}}, then {V,g,B) 

is a {2,3}-GDD of type 1^2^. Throughout this paper, we will use the exponential notation of GDDs. 

Remark 3. Let V = (V, B) be a design with points V = {pi,... ,Pv} and blocks B = {Bi,..., B},}. The 
incidence matrix A = (ajj) of is a u x 6 matrix defined by aij = 1 if p* belongs to Bj and aij = 0 
otherwise. For example, the incidence matrix of the above {2,3}-GDD is 


A = 


0 0 11 
10 10 
0 10 1 
0 110 
10 0 1 


Note that if all the blocks in a GDD are of the same size, then the points from the same group belong 
to the same number of blocks, whereas the points from different groups may occur in different number of 
blocks. Specifically, given a V'-GDD of type a point from the group of size 6 i is contained 

in Vi blocks, where 


Vi = 


Ip — 1 




(3) 


IV. Heterogeneous Fractional Repetition Codes 

In practical cloud storage systems, it is not necessary to have all the data chunks with the same repetition 
degree. Due to some special considerations, a fraction of the stored data chunks can be replicated more 
than others. We present here two observations: 

• Popularity: For a data object that consists of a large number of chunks, the popularity of each data 
chunk can be different. From a system perspective, some chunks may be frequently accessed, while 
others are rarely accessed by users. Therefore, having the popular chunks with more replicas allows 
for parallel accesses. 
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• Functionality: In a practical deployment setting, the functionalities of different packets may not be 
the same. It is desirable to have more important packets with higher repetition degrees. For example, 
the systematic packets of an MDS code deserve higher repetition degrees, since they provide direct 
data reconstruction without decoding. 

Heading towards this direction, we relax the constraint on the repetition degree in prior works and in¬ 
troduce heterogeneous fractional repetition (HFR) codes, where the repetition degrees of different packets 
can be different. Note that in analogy with FR codes, the encoding process of HFR codes is formed by 
the concatenation of two components: an outer MDS code followed by an inner heterogeneous repetition 
code. However, the repetition degrees of coded packets, which are determined by the heterogeneous 
repetition code, are not the same. 

In the encoding process of HFR codes, we first employ an outer (cj), M) MDS code to encode a source 
file consisting of M packets, where f coded packets are generated with indices {!,...,(/)} respectively. 
Further, we partition these f coded packets into A subsets Pi, P 2 ,..., P\ such that each packet from Pi 
is replicated pi times, where pi 7 ^ pj,'ii,j G {1,..., A}, i 7 ^ j. After the replication operation, all coded 
packets and the replicas are distributed over a large number of distinct storage nodes, which may spread 
over a wide geographical area. The policy that specifies the placement of all the coded packets, is called 
the heterogeneous repetition code. 

We refer to a HFR code as a A-HFR code if there exist A types of repetition degrees. It is clear that 
if A = 1, then the A-HFR code reduces to an FR code. 

A. Definition 

We present now the formal definition of HFR codes. For a positive integer n, we use [n] to denote the 
set { 1 , 2 ,... , n}. 

Definition 4. Let A and be positive integers, pi,... ,p\ be distinct positive integers, and <I>i,..., 
be a partition of [fi], A A-HFR code C for an (n, k, d) DSS, is a collection of n subsets .^ 1 ,..., of [0] 
that satisfies the following conditions: 

1) The cardinality of each is d, i.e., \Ci\ = d,l < i < n. 

2) Each point in occurs in exactly pi subsets in C, where 1 < i < A. 

3) Any pair of distinct points of [(j)] is contained in at most one subset, i.e., 1,^^ D ^j\ < 1, Vi, j G [n]. 
In the above definition, each subset corresponds to a specific storage node and the points in the subsets 

represent the indices of coded packets that are stored on the node. In other words, the node capacity is 
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given by the cardinality of the subset. Furthermore, we note that represents the packet set Pi, i.e., the 
points in are the indices of coded packets in Pj, 1 < i < A. In this sense, we obtain that 

n X 

i=l j=l 

We use Sc{k) to denote the number of guaranteed distinct packets of an HFR code C when contacting 
any k nodes, i.e., 

Sc{k) = inin |Ui 6 i^Ci|. (5) 

Kc[n],\K\=k 

For a DSS with parameters (n, k, d), Sc{k) denotes the maximum file size that can be stored using C. 
By following the same approach as @ Lemma 14], we can obtain the following upper bound on Sc{k), 

A 


Sc{k)< 


1 - 


i=i 


rzi 

(D 


( 6 ) 


The proof of ® is omitted. We refer to an FIFR code as optimal if it attains the upper bound. 


B. Incidence Matrix Viewpoint 

In this subsection, we provide an alternative viewpoint based on incidence matrix which completely 
characterizes the properties of FIFR codes. Consider a A-FIFR code C = of a symbol set 

{si,..., s<^}, we can construct a symbol-by-subset incidence matrix M with its entries mjj given as 


niij = 


1 , 

0 , 


Si £ ij') 
otherwise. 


Let Ti, Cj denote the rows and columns of M respectively, where i G [0], j € [n]. Note that the support 
of a nonzero vector o = (oi,..., o^), is the set of indices of its nonzero coordinates: supp{o) = {x\ox 7 ^ 
0}. In this sense, the first condition of HFR codes then translates to 


\supp{cj)\ = d,l < j < n. (7) 

Assume that {Ti, T 2 ,..., Tx} is a partition of {si,..., s^}, where Tk = G k G [A]. Thus, 
the second condition in Definition |4] is equivalent to that if Si 


\supp{ri)\ = pk-,1 <i < (j)- ( 8 ) 

We use Aj to denote the number of subsets containing Si and A* j/ to denote the number of subsets 
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containing Si,Sii simultaneously, where i,i' G [(/>]. Let denote the transpose of M. We thus have 

A-1,2 ■ ■ ■ 

M ■ ■ ■ 

Therefore, the third condition of HFR codes can he interpreted as 

A*,i'< G [(/)]. (9) 

We use the incidence matrix viewpoint to prove the necessary and sufficient conditions for HFR codes. 
Specifically, consider a design V wifh poinf set V and block set B, we can construct a A-HFR code hy 
setting [(j)] = V and C = H, if the following theorem suffices. 

Theorem 5. The blocks in a design T> = (V, B) form a X-HFR code if and only if the incidence matrix 
of V satisfies conditions (0-®- 

Proof: Necessify: If fhere exists a A-HFR code C, the symhol-hy-suhset incidence matrix M of C 
should satisfy conditions ©-®. By setting [f] = V and C = B, then M corresponds to the incidence 
matrix A of the design V. Thus, A subjects to conditions ©-®. 

Sufficiency: If fhe incidence mafrix of adopfed design subjects to (I7]l-®, the three conditions of A-HFR 
codes will be satisfied from the construction. Hence, the designed code is a A-HFR code by definition. 


MM^ = 




A^ 


2,1 


V. HFR Codes from Combinatorial Designs 

In this section, we first provide an explicit construction of A-HFR codes from group divisible designs. 
We further propose a novel design framework based on HFR codes. 

A. Code Constructions 

We start by presenting an example before introducing the code construction. Consider a 3-GDD of 
type T^3^, where the groups are partitioned as = {{!}, {2}, {3}, {4}, {5, 6, 7}}. Correspondingly, the 
block set is B = {{1, 2,5}, {1, 3,6}, {1,4, 7}, {2, 3, 7}, {2,4, 6}, {3, 4, 5}}. We adopt a (7,4) systematic 
MDS code to encode a data file of M = 4 packets, where the systematic packets are output with indices 
{1,..., 4} and parities are with indices {5, 6, 7} respectively. Each systematic packet is then repeated 3 
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Fig. 2. A 2-HFR code from 3-GDD of type 030 


times, while the parity packets are only repeated twice. As shown in Figure |2l if we take C = 0, we can 
place these coded packets on n = 6 nodes according to the 2-HFR code C. Furthermore, we observe that 
any DC contacting A: = 3 nodes can obtain at least 6 distinct packets. By using we can compute that 
<S'c(3) < [ 6 . 2 J = 6, which indicates that the 2-HFR code C is optimal. 

We present now explicit constructions of HFR codes from group divisible designs. Note that the nodes 
in the system, which correspond to blocks, are of the same storage capacity. We thus focus on GDDs with 
the same block size. The core idea behind the construction is that for a '0-GDD of type 
the points from different groups occur in different number of blocks. If we view the points as the indices 
of coded packets and the blocks as storage nodes, we can obtain different repetition degrees. 

Construction: Given a ^/;-GDD of type 0“^ • • • 0“^ with block set B, a A-HFR code C can be obtained 
by taking (/> = ui0i ua0a and C = B. Correspondingly, the A repetition degrees are given by 


Pi 


E^=i '>^wOw] - 0i 
il) — 1 


/ = !, 


,A. 


( 10 ) 


The constructed A-HFR code can be implemented in a DSS with parameters n = and 

d = 1 /). Note that our construction is based on the assumption that there exists a GDD of certain types, 
of which the existence results are presented in 1231 Part IV]. For instance, Abel et al. proved in the 
necessary conditions for the existence of a 5-GDD of type g^m^: 


Lemma 6. A 5-GDD of type exists if g = t) (mod 4), m = 0 (mod 4) and m < Ag/?), with the 
possible exceptions of {g,m) = (12,4) and (12,8). 


Further, since any two nodes intersect at not more than one point, we thus obtain the following theorem. 


Theorem 7. The X-HFR codes designed by the Construction above achieve the system storage capacity 
Cmbr under the random access repair model. 
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Proof: From the properties of group divisible designs, we have that any pair of points from different 
groups is contained in exactly one block. In other words, any two distinct nodes can not share more than 
one packet. Therefore, there are at most ( 2 ) repeated packets when contacting k nodes of the system, 
which suggests that HFR codes obtained by the Construction achieve the storage capacity Cmbr- ■ 

Remark 8. The metric of repair alternativity is introduced in |[25l . which measures the number of different 
subsets of nodes that enable the recovery of a failed node. We note that an erasure code with large repair 
alternatives increases the probability to recover lost data even in the presence of multiple failures. 

Due to the heterogeneity in repetition degrees, the repair alternativity of FIFR codes depends on the 
packets that stored on the failed node. Without loss of generality, we assume that the repetition degrees 
of the lost packets are pi,..., pd respectively. Since any pair of distinct points is contained in exactly 
one group or one block, the remaining replicas of all lost packets are spread over different nodes such 
that there will not exist a surviving node that contains one pair of the remaining replicas. Therefore, the 
repair alternativity of the constructed A-HFR code is 11^=1 (P* “ !)■ 

For an (n, k, d) DSS that implements FIFR codes, the repair alternativity of each node can reach a 
high level as node storage capacity and packet repetition degree increase. 

B. A Novel Design Framework 

In practical storage systems, it is desirable to have the systematic feature of MDS codes, for that the 
original file can be read directly from the uncoded copy without decoding. In this sense, we can always 
adopt systematic MDS code as the outer code. Recall that a DC connecting to any k nodes can recover 
the source file. However, fhe reconsfrucfion overheads can vary over a wide range, depending on which 
specific sef of k nodes have been confacfed. We illusfrafe fhis finding by fhe following example. 

Consider a sysfemafic (10, 6) MDS code wifh 6 systematic packets of indices {1,..., 6} and 4 parity 
packets of indices {7,..., 10} respectively. Each systematic packet is repeated 4 times, while each parity 
packet is only replicated 3 times. We distribute these coded packets over n = 12 nodes according to the 
2-HFR code: 

{1,3, 7}, {1,4,8}, {1,5, 9}, {1,6,10), {2, 6, 7}, {2, 5, 8}, 

{2,3, 9}, {2,4,10), {4, 5, 7}, {3, 6, 8}, {4,6 ,9}, {3, 5,10). 

We label the nodes by A^i,..., N 12 respectively, from left to right and top to bottom. It can be observed 
that any DC who connects to A; = 3 nodes can obtain at least 6 distinct packets, which are adequate to 


September 15, 2015 


DRAFT 


TABLE I 

Simulation Results 


13 


N{s) 

N{p) 

Occurrence Number 

Frequency 

3 

3 

0.363 X 10^ 

0.0363% 

4 

2 

1.636 X 10'' 

0.4909% 

3 

3.273 X 10' 

5 

2 

3.273 X lO-" 

0.4364% 

3 

1.091 X 10' 

6 

1 

0.182 X 10'' 

0.0364% 

3 

0.182 X 10' 


r 
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Fig. 3. All the storage nodes are partitioned into 4 virtual clusters. 


retrieve the source file. However, if he connects to the 3 nodes A^i, and iVg, then he will get all the 
6 systematic packets and no additional operations are needed. While contacting nodes A^i, and A^i 2 , 
he will only obtain 3 systematic packets. Therefore, it is necessary to perform some decoding operations 
on the obtained parity packets, such as matrix inversion and linear combinations. 

Simulation: We simulate the behavior of a DC and pick a set of A; = 3 distinct nodes at random from 
the 12 nodes. Among all the distinct packets obtained from the contacted nodes, we use N{s),N{p) to 
denote the number of systematic packets and parity packets respectively. In the simulation, we randomly 
contact 3 nodes 10® times. From Table H we observe that a DC can obtain all the 6 systematic packets 
(regardless of the number of parity packets) with less than 4% probability. In other words, the DC needs 
to complete the decoding operations with more than 96% probability. 

The simulation results indicate the inefficiency of data reconstruction under the random access model 
adopted in the literature. Motivated by this, we carefully partition the above 12 nodes into 4 virtual clusters 
..., S 4 , where each cluster contains one copy of the systematic packets, as depicted in Figure |3] In 
this sense, a data collector can readily reconstruct the source file without decoding if he connects to any 
one of the four clusters. 
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It is clear that the cluster-hased nodes partition model can reduce the data reconstruction time as com¬ 
pared to the random access model. We note that the partition may allude to some practical implications. 
For instance, the source file may he mainly needed hy clients from several different geographical regions 
in practical systems. If we deploy the nodes from each cluster into each region, the clients can read the 
file in a simple manner. Moreover, since practical deploymenfs always confain a track server storing the 
system metadata, we can write the cluster details as well as the repair table 0 into the metadata, which 
can he read directly to facilitate both data reconstruction and node recovery. 

In general, the n nodes of an HFR code can be partitioned into fe-sized clusters if each cluster contains 
all the M original packets. Suppose that n = tk,t > 2 and the data packets are indexed by {1,...,M}. 

Proposition 9. Given a X-HFR code C = {^i,..., ^n}, let , Ht be a partition of C, where 

Hi = < i < t. Let Di denote the collection of distinct points in Hi, i.e., Di = 6 

If [M] C Di, the k nodes in each Hi can be viewed as a cluster. 

We present now the necessary conditions that an HFR code constructed from GDDs can be partitioned 
with the desirable reconstruction property. Note that there are two types of packets in a systematic MDS 
code, we thus focus on the case that A = 2. We further assume that each node contains exactly one parity 
packet and nodes with the same parity packet are included in the same cluster. 

Consider a systematic MDS code with parameters {aOi + 02 ,a 9 i), where the systematic symbols are 
labeled by a9i} and parity packets by {a9i -t- 1,..., a9i +92} respectively. In our construction, 

we adopt a class of V'-GDDs of type 9f9}. The points from the groups of size 9i give the indices of the 
systematic symbols and points in the group of size 02 will represent parities. We assume that systematic 
symbols are of the higher repetition degrees, which implies that 9i < 02- Since each cluster contains one 
copy of the systematic packets and one distinct parity packet, it is clear that the repetition degree of each 
systematic packet is equal to the number of parity packets, i.e., 

(a - 1)01 + 02 


02 = 


tp — 1 


( 11 ) 


The above 2-HFR codes are constmcted assuming the existence of i/^-GDDs of desired parameters. We 
note that the necessary conditions for V’-GDDs of type 0“02 for small block sizes (i.e., ip = 3,4, 5), are 
given in If23| . If we evaluate all the possible group types, then we have the appropriate designs of small 
order, such as 3-GDD of type {1^3^, 2^4^, 1®5^} and 4-GDD of type 1^4p, etc. 
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VI. Conclusion 

In this paper, we generalize the application of FR codes and propose HFR codes, in which the repetition 
degrees of coded packets are different. We present explicit constructions of HFR codes derived from group 
divisible designs. We show that the constructed HFR codes achieve the system storage capacity under 
random access repair and have multiple repair alternatives upon node failure. Further, we present a novel 
design framework based on HFR codes where storage nodes can be wisely partitioned into several clusters. 
The proposed framework in conjunction with the systematic feature of MDS codes can significantly reduce 
the data reconstruction time. Exploring constructions of HFR codes from other combinatorial structures 
is an interesting direction for future research. 
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